Tree Edit Distance Cannot be Computed in Strongly Subcubic Time (unless APSP can)

نویسندگان

  • Karl Bringmann
  • Pawel Gawrychowski
  • Shay Mozes
  • Oren Weimann
چکیده

The edit distance between two rooted ordered trees with n nodes labeled from an alphabet Σ is the minimum cost of transforming one tree into the other by a sequence of elementary operations consisting of deleting and relabeling existing nodes, as well as inserting new nodes. Tree edit distance is a well known generalization of string edit distance. The fastest known algorithm for tree edit distance runs in cubic O(n) time and is based on a similar dynamic programming solution as string edit distance. In this paper we show that a truly subcubic O(n3−ε) time algorithm for tree edit distance is unlikely: For |Σ| = Ω(n), a truly subcubic algorithm for tree edit distance implies a truly subcubic algorithm for the all pairs shortest paths problem. For |Σ| = O(1), a truly subcubic algorithm for tree edit distance implies an O(nk−ε) algorithm for finding a maximum weight k-clique. Thus, while in terms of upper bounds string edit distance and tree edit distance are highly related, in terms of lower bounds string edit distance exhibits the hardness of the strong exponential time hypothesis [Backurs, Indyk STOC’15] whereas tree edit distance exhibits the hardness of all pairs shortest paths. Our result provides a matching conditional lower bound for one of the last remaining classic dynamic programming problems. The collaboration on this paper was established during Dagstuhl seminar 16451. ∗Max Planck Institute for Informatics, Saarland Informatics Campus †University of Haifa. Partially supported by the Israel Science Foundation grants 794/13 and 592/17. ‡IDC Herzliya. Partially supported by the Israel Science Foundation grants 794/13 and 592/17.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Subcubic Equivalences Between APSP, Co-Diameter, and Other Complementary Problems∗†

Despite persistent effort, there is no known technique for obtaining super-linear lower bounds for the computational complexity of the problems in P. Vassilevska Williams and Williams [38] introduce a fruitful approach to advance a better understanding of the computational complexity of the problems in P. In particular, they consider All Pairs Shortest Paths (APSP) and other fundamental problem...

متن کامل

Subcubic Equivalences Between Graph Centrality Problems, APSP and Diameter

Measuring the importance of a node in a network is a major goal in the analysis of social networks, biological systems, transportation networks etc. Different centrality measures have been proposed to capture the notion of node importance. For example, the center of a graph is a node that minimizes the maximum distance to any other node (the latter distance is the radius of the graph). The medi...

متن کامل

Fast Computation of the Tree Edit Distance between Unordered Trees Using IP Solvers

We propose a new method for computing the tree edit distance between two unordered trees by problem encoding. Our method transforms an instance of the computation into an instance of some IP problems and solves it by an efficient IP solver. The tree edit distance is defined as the minimum cost of a sequence of edit operations (either substitution, deletion, or insertion) to transform a tree int...

متن کامل

The Intractability of Computing the Hamming Distance

Given a string x and a language L, the Hamming distance of x to L is the minimum Hamming distance of x to any string in L. The edit distance of a string to a language is analogously defined. First, we prove that there is a language in AC such that both Hamming and edit distance to this language are hard to approximate; they cannot be approximated with factor O(n 1 3 − ), for any > 0, unless P =...

متن کامل

An Improved Algorithm for Tree Edit Distance Incorporating Structural Linearity

An ordered labeled tree is a tree in which the nodes are labeled and the left-to-right order among siblings is significant. The edit distance between two ordered labeled trees is the minimum cost of transforming one tree into the other by a sequence of edit operations. Among the best known tree edit distance algorithms, the majority can be categorized in terms of a framework named cover strateg...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2018